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OO ' Abstract. — Wc study the geometry of the space of measures of a 

compact ultrametric space X, endowed with the L^ Wassertein distance 
from optimal transportation. We show that the power p of this distance 

.^ . makes this Wasserstein space affinely isometric to a convex subset of £^ . 

["t ' As a consequence, it is connected by ^-Holder arcs, but any a-H61der 



arc with a > - must be constant. 

This result is obtained via a reformulation of the distance between two 
measures which is very spectific to the case when X is ultrametric; how- 
ever thanks to the Mendel- Naor Ultrametric Skeleton it has consequences 
even when X is a general compact metric space . More precisely, we use 
it to estimate the size of Wasserstein spaces, measured by an analogue 
^ ' of Hausdorff dimension that is adapted to (some) infinite-dimensional 

spaces. The result we get generalizes greatly our previous estimate that 
-vj ' needed a strong rectifiability assumption. 

^y-^ I The proof of this estimate involves a structural theorem of indepen- 

l' ' dent interest: every ultrametric space contains large co-lipschitz images 

(^ [ of regular ultrametric spaces, i.e. spaces of the form {!,..., k}^ with a 

fT^ ' natural ultrametric. 



j^ i 1. Introduction 

Given a metric space X, that we shall always assume to be compact, 
one can define its L^ Wasserstein space #^(X) = {^{X),Wp) as the 
set of its Borel probability measures ^(X) endowed with a distance Wp 
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defined using optimal transportation (see below for precise definitions). 
In some sense, >^(X) can be thought of as a geometric measure theory 
analogue of U* space, although its geometry is finely governed by the 
geometry of X as Wp involves the metric on X in a crucial way; in 
particular, the great variety of metric spaces induces a great variety of 
Wasserstein spaces. As a consequence, the natural affine structure of 
^{X) (i.e., its afiine structure as a convex in the dual space of continuous 
functions) is in general only losely related to the geometric structure of 

Wp. 

The links between optimal transportation and geometry have been the 
object of a lot of studies in the past decade. In a series of paper we try 
to understand what kind of geometric information on ^(X) can be ob- 
tained from given geometric information on X. We considered for exam- 
ple isometry groups and embeddability questions when X is a Euclidean 
space |KlolO] or, with Jerome Bertrand, a Hadamard space |BK12] , and 
the size of Wp{X) when X is (close to be) a compact manifold |Klol2] . 

Here we consider the case when X is a compact ultrametric space, i.e. 
satisfies the following strenghtening of the triangular inequality: 

d{x,z) ^ max((i(x,?/), (i(|/, 2;)). 

Examples of compact ultrametric spaces include notably the set of p-adic 
integers Zp or more generally the set {!,..., /c}^ of infinite words on an 
alphabet with k letters, endowed with the distance d{x, y) = q-'^^^b'^i^^Vi} 
where g > 1 and x = {xi, X2, ■ ■ ■), y = (2/1, y2, ■ ■ ■)■ We shall call these 
examples regular ultrametric space and denote them by Y{k,q). 

1.1. Embedding in snowfiaked £^ . — Ultrametric spaces are in some 
sense the simplest spaces in which to do optimal transportation, thanks 
to the very strong structure given by the ultrametric inequality. We are 
therefore able to give a very concrete description of Wp{X). 

Theorem 1.1. — If X is a compact ultrametric space, then (^(X), Wp) 
is affinely isometric to a convex subset of i^ . 

Another way to state this result is to say that WpiX) is affinely iso- 
metric to a convex subset of i^ endowed with the "snowfiaked" metric 
II ■ 11;^. Note that the existence of such an embedding, both affine and 
geometrically meaningful, of Wp{X) into a Banach space seems to be 
quite exceptional; the closest case I know of is ;;^(]R), which is isomet- 
ric to a subset of increasing functions in L^([0,1)], but even there the 
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isometry is not affine. In fact, the decorrelation between the affine struc- 
ture of ^{X) and the geometry of Wp is an important reason for the 
relevance of Wasserstein spaces, as the very fact that geodesies in the 
space of measures (when they exist) are usually not affine line made it 
possible to define a new convexity assumption that turned out to be very 
successful, see notably |McC97] . In this sense. Theorem 11.11 is a neg- 
ative result: when X is ultrametric there is little more to ;;^(X) than 
to j^{X), as far as we are concerned with notions which are affine in 
nature (e.g. convexity). We shall see, however, that this result has nice 
consequences. 

As is well-known, snowflaked metrics are geometrically very discon- 
nected (all rectifiable curves are constant) and this affects the geometric 
connectivity of Wasserstein space. 

Corollary 1.2. — If X is a compact ultrametric space, Wi{X) is a 
geodesic space, and for p > 1, #^(X) is connected by --Holder arcs 
but any a- Holder arc with a > - must be constant. 

1.2. Size estimates. — While their strong structural properties make 
ultrametric space feel quite easy to deal with, they also happen to be 
ubiquitous, as shown by the ultrametric skeleton of Mendel and Naor 
|MN13al IMN13b] : very roughly, any metric space contains large al- 
most ultrametric parts. This powerful result enables us to control very 
precisely the size of very general Wasserstein spaces. 

Theorem 1.3. — Given any compact metric space X (not necessarily 
ultrametric) , we have 

crit,92 Wp{X) ^ dimX. 

Here, dim denotes the Hausdorff dimension, and crit^ is the power- 
exponential critical parameter introduced in |Klol2] . which is an exten- 
sion of Hausdorff dimension that distinguishes some infinite-dimensional 
spaces. This bi-Lipschitz invariant is constructed simply by replacing 
the terms e'^ by exp(— e~*) in the definition of Hausdorff dimension. In 
particular, the above result implies that to cover Wp{X) one needs at 
least (very) roughly exp(e~'^™^) balls of radius e. 

As we proved in jKlol2] that crit,^? y^p{X) is at most the upper Min- 
kowski dimension M-dimX of X, the following results follow at once 
from Theorem II. 3[ 
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Corollary 1.4- — Let X he any compact metric space (not necessarily 
ultrametric) ; z/dimX = M-dimX = d, then crit^ WpiX) = d. 

This greatly generalizes two of the main results of [KIol2] . focused on 
p = 2 and where either instead of dim X = d we had to assume the much 
stronger assumption that X contains a bi-Lipschitz image of [0, l]'^, or 
we could only conclude a much weaker lower bound on the size of W2{X) 
(even weaker than crit^ #2(-^) > 0). 

Corollary 1.5. — Let X,X' be two compact metric spaces. If dim X > 
M-diniX', then there is no bi-Lipschitz embedding #^(X) — )■ #^/(X'), for 
any p,p' G [1, oo). 

Let us also note that the Mendel-Naor ultrametric skeleton Theo- 
rem readily implies that for all e > 0, ^(X) contains a subset S with 
crit^ S > {1 — e) dimX that embeds in an ultrametric space with dis- 
torsion 0{l/e). 

It would be more natural to replace the inequality on dimensions 
in Theorem 11.31 bv dimX > dimF, but our method cannot give that 
stronger statement. This seems inevitable since the Hausdorff dimension 
of X gives no upper bound on the critical parameter of its Wasserstein 
spaces. 

Proposition 1.6. — There is an ultrametric space X such that X is 
countable (in particular dimX = 0^ but crit,^ Wp{X) = +oo for all p. 

1.3. Structure of ultrametric space. — In addition to the Mendel- 
Naor theorem, one important ingredient in the proof of Theorem 11.31 is a 
structural result that seems of interest in itself, according to which every 
compact ultrametric space contains (in a weak sense) a large regular part, 
which is easier to deal with. 

Theorem 1.7. — Given any compact ultrametric space X and any s < 
dimX, there is a regular ultrametric space Y = Y{k,q) of Hausdorff 
dimension at least s and a co-Lipschitz map ip : Y ^ X . 

By a co-Lipschitz map we mean that for some c > and all a,b E Y 
one has 

d{(p{a), f{b)) ^ c ■ d{a, h). 

This shows that, a bit like ultrametric spaces are ubiquitous in metric 
spaces, regular ultrametrics are ubiquitous in ultrametrics (and therefore 
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in metric spaces). The fact that we only obtain a co-Lipschitz map is a 
strong hmitation, but this is sufficient for our present purpose. 

1.4. Organization of the article. — In the next section, we intro- 
duce briefly some classical definitions and facts concerning Wasserstein 
spaces, ultrametric spaces and we recall some properties of critical pa- 
rameters. In Section[3l we prove Theorem 1 1.1 1 and corollary ll.2[ SectionH] 
is devoted the purely ultrametric Theorem II. 7^ while the closing Section 
O deals with Theorem II. 3^ its corollaries and the example in Proposition 

2. Preliminaries 

2.1. Wasserstein spaces. — We limit ourselves in this short presen- 
tation to the case of a compact metric spaces X whose distance is denoted 
by d. The set of Borel probability measures ^{X) is naturally endowed 
with a set of "Wasserstein" distances that echoe the distance of X: for 
any p G [1, +oo), one sets 

W„(/i,z/)=( inf I d(a,6Fn(dad6)V 
Vner(M,^)ixxX / 

where r(/i, v) is the set of transport plan, or coupling of (/i, z/), that is to 
say the set of measures 11 on X x X that projects on each factor as /x 
and z/: 

n(v4xX) =/i(A), n(Xxfi) = z/(fi) V Borel A,5cX 

In other words, a transport plan specifies a way of allocating mass dis- 
tributed according to /x so that it ends up distributed according to z/; its 
LP cost 



-p 



(n) := inf / t/(a,&)Pn(dad6) 



is the total cost of this allocation if one assumes that allocating a unit of 
mass from a point to a point d away costs (i^, and Wp(Ai, vY is the least 
possible cost of a transport plan. 

It is easily proved (see e.g. |Vil09] for this and much more) that the 
infimum is realized by what is then called an optimal transport plan, 
that Wp is indeed a distance and that it metrizes the weak topology. We 
shall denote by Wp the metric space (^(X),Wp) and call it the {U) 
Wasserstein space of X. 
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We shall need very little more from the theory of optimal transporta- 
tion, let us only state two further facts. 

First, an easy consequence of "cyclical monotonicity" is that if X is 
a metric tree (or its completion), e is an edge of X, 11 is an optimal 
transport plan between measures /i, z^ G Wp{X) supported outside e, and 
Xi, X2 are the connected components of X \ e, then 

n(Xi X X2) = max(/i(Xi) - i^(Xi),0); 

in other words, no more mass is moved through an edge than strictly 
necessary. 

Second, the concept of displacement interpolation shall prove conve- 
nient. If X is a geodesic space, /i, i^ G #^(X) and 11 is an optimal 
transport plan (implicitely, for the cost Cp and from fi to u), then it is 
known that there is a probability measure tt on the set of constant speed 
geodesies [0, 1] — )■ X such that if one draws a random geodesic 7 with 
law 71, the random pair (7(0), 7(1)) of its endpoints has law 11. In other 
words, if Ci : 7 I— )■ j(t) is the specialization map, 11 = (eo,ei)#7r. The 
measure vr is called an optimal dynamical transport plan. The interest of 
this description is that Wp{X) is geodesic, and all its geodesic have the 
form (ef#7r)fg[o_i] for some optimal dynamical transport plan vr. 

Now, if X is a metric tree, let say that two geodesies 71, 72 are an- 
tagonist if they both follow some edge e, in opposite directions. Then 
an optimal dynamical plan vr between any two measures is supported on 
a set without any pair of antagonist geodesies. This is of course closely 
linked to the cyclical monotonicity. 

2.2. Critical parameters. — Critical parameters where introduced 
in |KIol2j as bi-Lipschitz invariants similar to Hausdorff dimension but 
that can distinguish between some infinite-dimensional spaces, notably 
many Wasserstein spaces. 

We shall not recall their construction here, but only a few facts that 
we shall use in the sequel. First, to define a critical parameter one needs 
a so-called scale, a family of functions playing the role played by (r i— )■ r*) 
in the definition of Hausdorff dimension. We restrict here to the power- 
exponential scale ^ = (r I— )■ exp(— l/r'*))s>o and denote the correspond- 
ing critical parameter by crit ■3^. 

Then the Frostman Lemma (see e.g. |Mat95j ) gives a characterization 
of crit,^, from which we extract the following conditions: 
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— if crit,92 > s then there is /i G j3^{X) and C > such that for all 
X e X and all r > 0, fi{B{x,r)) < C exp{—l/r^); 

— if there is a measure // G ^(X) as above, then crit^ > s. 

In other words, X has large critical parameter if it supports a very spread 
out measure, crit^k) X is zero when X has finite Hausdorff dimension, but 
turns out to be non-zero for many interesting infinite-dimensional spaces. 
A similar statement holds for Hausdorff dimension. 

The second important fact is that crit^ can only increase under co- 
Lipschitz maps: ii f : Y \-^ X satisfies d{f{a), f{b)) > c • d{a, b) for some 
c > and all a,b &Y, then crit,^(X) > crit,^(F). To prove lower bounds 
on critical parameters, our strategy will be to find in our space of interest 
co-Lipschitz images of spaces supporting a well spread-out measure. 

To do that, we will need spaces on which such measures are easy to 
construct. We shall use the Banach cubes 

BC((a„)„) := {{xn) e £^ I < x„ < a„Vn G N} 

defined for any d.^ positive sequence (a„). In |Klol2] . BC((a„)„) was 
denoted BC([0, 1], 1, (a„)„) as a more general family of Banach cubes 
was defined. It can be proved that 

crit^BC((n-")) = —^ 
a — 1 

for all a > 1, in the same way used in |Klol2] to prove the Hilbertian 
version of this estimate. 

2.3. Ultrametric spaces. — According to the Ultrametric skeleton 
Theorem |MN13al IMNlSb] , given any compact metric space X and 
any s < dimX, there is an ultrametric space X' of dimension at least s 
and a bi-Lipschitz embedding X' ^ X (moreover, the distorsion of this 
bi-Lipschitz embedding is 0(dimX — s)~^)). This simply stated result 
is very powerful, see |Naol2] . 

The other, older and classical fact we shall need about ultrametric 
spaces is their description in terms of trees (see e.g. |Naol2] Section 8.1 
or IGNSOOj ). 

By a tree we mean a simple graph T with vertex set V and edge 
set E, which is connected and without cycle; it can be infinite but is 
assumed to be locally finite. A tree is rooted if it has a distinguished 
vertex o, which is not a leaf, and called the root. A vertex v ^ a has 
a unique parent v*, defined as the neighbor of v closer to o than w; f is 



8 BENOIT R. KLOECKNER 

then said to be a child of v*. Each edge has a natural orientation, from 
parent to child: {v*v) is said to be a positive edge. A height function 
is a function h : V ^ [0, +00) that is decreasing: h{v) < h{y*) for all 
V ^ o (note that our trees have the root on top). A synchronized rooted 
tree (SRT for short) is a rooted tree endowed with a height function 
such that for any maximal (finite or infinite) oriented path o, t"!, ^2, • • • , 
we have lim/i(t'„) = (in particular, all leaves have height 0). The 
metric realization of a SRT T is the metric space obtained by taking 
a segment of length h{x) — h{y) for each positive edge (xy) and gluing 
them according to the combinatoric of T. It is still denoted by T, and 
the height function can be extended linearly on edges and continuously 
to the metric completion T of T; this extension is still denoted by h. By 
construction, h~^{0) is the union of all leafs of T and of T \ T. It is not 
hard to check that h~^{0), endowed with the restriction of the metric of 
T, is ultrametric. We can now state the description alluded to above. 
Any compact ultrametric space can be isometrically identified with 
the level h~^{0) of the completion of a metric SRTT. 

For each vertex v of T, the set of points in X that can be reached 
by an oriented path from u is a metric ball of X, denoted by X^ and 
of diameter 2h{v). All balls of X are of the form X^ for some f , e.g. 
X = X,. 

The proof of the above folkloric fact is not difficult, and can be found up 
to little notational twists in the references cited above: one simply use the 
ultrametric inequality to partition X into maximal proper balls, which 
will be identified with the children of o, and then proceeds recursively. 



3. £^ coordinates 

The goal of this Section is to prove Theorem 11.11 given a compact 
ultrametric space X and p G [l,oo), we want to construct an affine 
isometry from (^(X), Wp) to a convex subset of £^, that is a map ip : 
^{X) — )■ (^ with convex image and such that 

^{t^i + {I - t)u) = t^{ii) + {I - t)^{u) Wfi,ue ^{X),\fte [0,1] 

and 

y{fi) - (/.(z/)||i = Wp(/i; lyf V/i, u e ^{X). 

Up to coefficients, this map is simply constructed by mapping a measure 
to the collection of masses it gives to balls of X. 
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3.1. A formula for the Wasserstein distance. — Let T be a metric 
SRT sucli tliatX = /I'^O) C T as explain in g231 Then ^{X) is the 
subset of ^(T) made of measures concentrated on X, and Wp is the 
restriction to j^{X) of the Wasserstein metric on T, also denoted by Wp- 
Given a geodesic 7 of T, let -£^(7) be the set of edges runned through by 
7 and ^+(7) be the topmost vertex on 7. Given e = {v*v) G E, set 
6h^{e) = h{v*y — h{vy. Last, given e G -E let e_ be its lower vertex and 
r(e) be the set of geodesies going through 7 in any direction (not to be 
confused with a set of optimal transport plan!). 

Lemma 3.1. — For all fiji' & B^{X), we have 

(1) Wp(/i,i^r = 2P-i Y. 5h^{v*v)\^l{X,)-v{X,\. 

Proof. — Let 11 G r(/i, v) be an optimal transport plan. For any vertex 
V & T, the components of T \ (vv*) intersect with X along X^ and 
X^ := X \ Xy. As noted in §2.1^ optimality implies that n(A„ x A^) = 
max(0,/i(Aj,) — viX^)) and n(A^ x X^) = max(0, i^(A^) — /i(Aj,)): the 
total amount of mass that moves between A„ and its complement is 
|/i(A,-K^„)|. 

Let TT be an optimal dynamical transport plan on T such that 11 = 
(co, ei)#7r. Then for all geodesic 7 between two points of A, we have 

d(7(0),7(l)r = (2%+(7))r = 2^-^ Y, ^h^ie) 

eeE{y) 

from which it follows 

Wp(/x,z/r = Cp(n) = y"rf(7(0),7(l)rvr(d7) 

= /"2P-1 Y Shf{e)7r{d-f) 
= 2^-i5^<5/.^(e)vr(r(e)) 

eeE 

(2) = 2P-i^5/iP(e)|/i(^e_)-K^e_)|. 

eeE 

which is ([I]). D 
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3.2. Proof of Theorem 11.11 — Identify the set of non-root vertices 
V \ {o} of T with the positive integers, so that i^ = L^{V \ {o}) (with 
the counting measure). Let ip : ^{X) — )■ i^ be defined by 



^(/i) = {2^-'5hP{v*v)^i{X,,)),^ 



j^o&V 



Lemma 13.11 shows that (p is an isometric embedding, and it is obviously 
affine. It extends as an affine embedding from the space of signed measure 
(in which j3^{X) is convex) to i^, so that ip has convex image: Theorem 
II. H is proved. 

For Corollary \1.'2\ recall the fact that a convex subset of i^ is geodesic 
(thus connected by Lipschitz arcs) and that for any metric space {Y,d), 
d^ defines a distance without any non-constant Lipschitz curve whenever 
f3 e (0,1) (which is folklore, see e.g. Lemma 5.4 in |BK12j . for the 
simple proof). 

Apply this to 



with /3 = 1/ap < 1 where, thanks to Lemma [3. 1[ we know that WS is a 

distance: we get that Wp is a distance without non-constant Lipschitz 
curve. This means precisely that Wp has no non-constant a-Holder curve. 



4. Regular ultrametric parts in ultrametric spaces 



Let us now prove Theorem 11.71 We are given a compact ultrametric 
space X and s < dimX, and we look for a regular ultrametric space 
Y = Y{k, q) with dimy > s and a co-Lipschitz map (p -.Y -^ X. 

Let T be a SRT such that X = h~^{0), and fix e > such that 
s' := s + e < dimX. We assume, up to a dilation of the metric, that 
diamX = 1. 

4.1. Defining k and q. — By Frostman's Lemma, there exists on X 
a probability measure /i and a constant C such that for all x E X and 
all r, we have 

(3) /i(5(x,r)) <Cr^'. 

Choose an integer k such that 3k > 3^ ^^ and 3k > 2C, and let q = 
{3kY^'^' . This choice of q ensures that Y{3k,q) has Hausdorff (and 
Minkowski) dimension equal to s', and the first bound on k ensures that 
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Y = Y{k,q) has dimension at least s. The second bound on fc is a 
technicahty to be used later. 

The SRT of Y is the /c-regular rooted tree with height function h{v) = 
ig~" whenever v is at combinatorial distance n from the root. Our strat- 
egy is now to change slightly the metric on X, then use // to transform 
T into a regular tree, while controling both distances from above and 
dimension from below. 

4.2. Changing heights. — The following Lemma is folklore. 

Lemma 4-1- — There is an ultrametric d' on X such that all halls of 
X have diameters of the form q~"' with integer n, and 

d{x, y) < d'{x, y) < q ■ d{x, y) Vx, y ^ X. 

Said otherwise, this lemma ensure that we can assume that h takes 
only the values ^q'"" {n G N) on vertices of T 

Proof. — Consider the SRT obtained from T by changing h{v) into the 
smallest \q~'^ larger than h{v). The level of h' is still naturally identified 
with X, the induced distances are no smaller than the original one, and 
they are larger by at most a factor q. D 

Note that the measure /i still satisfies (j3]) in the new metric with the 
same C. From now on, we assume X satisfies the conclusion of the above 
lemma. 

4.3. Regrouping branches. — Consider now the children wi, V2, . . . ,Vj 
of the root, and let Vi, . . . , Vj be a partition of {vi, . . . , Vj} into at least 
two sets of consecutive vertices such that 

^-Cq~^'<J2f^iX.)<lcq~^'. 

veVi 

Such a partition exists thanks to the second bound on k, which ensures 
that Cq~^ < |. Let T^ be the SRT obtained from T by (see figure [1]): 

— adding a degree two vertex at height q~^ on the edge [ovi) whenever 
h{vi) < q-'\ 

— reassigning the name Vi to this new added vertex, 

— then for each / G {1, . . . , J}, merging all Vi G Vj into a new vertex 
of height q~^ , whose children are the union of all children of the Vi E Vj. 
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Figure 1. The regrouping process: left the original tree (par- 
tially represented), center the tree with added vertices, right 
the tree with regrouped branches. 



Below depth 1, the tree is unchanged and there is therefore a natural 
identification of X with the level in T^, and the distance induced by 
T^ is no larger than the original one (it can be much smaller for some 
pairs of point, and this is why we will only obtain a co-Lipschitz map). 
Another way to put it is that there is a bijective and 1-Lipschitz map 
/^ : X — )■ X^ where X^ is the level of T^. Moreover /x^ := /i/x is a 
probability measure on X^ satisfying both 



/i'(^.') < C{A:mmXly' =C-q 



when V has height q 



" with n > 1 and 
1 



-Cq-' <^^\Xl)<-Cq- 



when w is a child of the root (i.e., has height q ^). 

We can then inductively construct a sequence T'^,T^, 



rpk 



. of 

SRT by performing the same regrouping process at depth k (i.e., the 
vertices of height q~^^~^\ play the role played above by o). First this 
construction ensures that in T'^, a child v of any vertex v* of height 



h (v*) = g " for any n < k must have height h (v) = q 



-(n+i). We also 
get a system of bijective 1-Lipschitz maps f'' : X^~^ — )■ X'' where X'' is 
the ultrametric space defined by T^, and probability measures fi^ that 
satisfy 

/(^.') < C(diamX,^ 

when V has height g~" with n > k and 

1 



C-q^ 



-Cq-''' < ii'iXl) < -Cq 



when V has height q " with n < k. 
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Since T^ and T^+i are isomorphic up to depth k, there is hmiting SRT 
T°°, defining an ultrametric space X°° and there is a 1-Lipschitz map 
/ : X — )■ X°° obtained by composing all f''. This map needs not be 
bijective, because some distances may have been reduced to zero in the 
process, collapsing some points together; but it certainly is onto. 

Moreover, the probability measure f^fi = /i°° satisfies 

whenever v has depth n. This ensures that any vertex in T°° (except 
possibly the root) has at least 



children. In particular, T°° has a subtree isomorphic to the SRT of Y, 
so that there is an isometric embedding g : Y ^ X°°. 

Composing g with a right inverse of f°° (which exists at worst in the 
measurable category in virtue of a classical selection theorem), we get 
a co-Lipschitz map F — > X, which proves Theorem 11.71 (recall that our 
choice of parameter ensures dimy > s). 

5. Size of Wasserstein spaces 

5.1. The case of regular ultrametric spaces. — In the previous 
Section, we saw that ultrametric spaces contain co-Lipschitz images of 
large regular ultrametric spaces. To prove Theorem 11.31 we therefore 
have mainly left to estimate the size of Wasserstein spaces of regular 
ultrametric spaces. 

Proposition 5.1. — Given any regular compact ultrametric space Y = 
Y{k, q) and any p > 1, we have 

crit^ Wp{Y) = \ogg{k) = dimY. 

Proof. — Since the upper Minkowski dimension of Y is equal to its 
Hausdorff dimension, the upper bound ciiig^ WpiY) < dvavY is given 
by Proposition 7.4 in [[REF]]. To prove the lower bound, we shall embed 
large Banach cubes in WpiY). Fix any positive e. 

Label as usual the vertices of the SRT for Y (i.e, its balls) by the finite 
words on the letters {1, . . . , k}. A vertex v = iii2 . . .in is said to have 
depth n, has height ^g~", and there are /c" of them. Let V be the set 
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of vertices v = ii . . .in such that in < k, and define a map </? : [0, 1]^ — )■ 
j^{Y) as follows. The measure /i = ^p{{av)vev') is determined by the 
weight it gives to the balls Yy [v G V) of Y, which we define recursively 
on the depth to be: 

- KY) = 1, 

- /i(n) = /i(i;0 ■ ^^^ when V e V, 

- /i(i;) = ^i{Y^*) - ^ ^i{Yjj when v = ii . . . in-ik. 

W*=V'* ,WJ^V 

In other words, at each level we split mass almost equally between the 
children, allowing it to be a little larger than average for the k — 1 first 
children and consequently a little smaller for the last one. 

The first and second item are mandatory to get a well defined prob- 
ability measure, taking e small enough ensures that all these values are 
positive, and this construction ensures that 

whenever v has depth n. 

From ([T]) in Section [31 we deduce that for all a = {ay),b = (by) G 
[0,1]^': 

Wp(^(a), ^{b)Y >CY, 9^"" (^—^1-^) l«^' - ^''l 

where the positive constant C depends on g, k^p^e and n = n{v) is the 
depth. 

Since there are {k — Vjk"^'^ vertices of depth n in V , if we identify the 
vertices with the positive integers in a way that makes the depth function 
n non-decreasing, we can identify the sequence 

g 1 J (^0 

where v runs over the vertices with an integer-indexed sequence {am) 
where 
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It follows that there is a co-Lipschitz map from BC((m °')m) to (^(F), Wp) 
with 



As a consequence, 



crit,^(^(F), WP > crit,# BC([0, 1], 1, (n"")) ^ 



a — 1 

and letting e go to 0, we have crit-3^(<^(y), WS) > dimy/p from which 
crit,^ Wp{Y) > dimY follows. D 

5.2. The main Theorem and corollaries. — Now the proof of The- 
orem [T3] is easy: we are given a compact metric space X, and we want 
to prove 

crit^^(F) > dimX. 

If dimX = there is nothing to prove; assume otherwise and choose 
arbitrary s < s' < dimX. There is a bi-Lipschitz embedding of an 
ultrametric space X' into X with dimX' > s' (by the Mendel-Naor 
ultrametric skeleton theorem) and there is a co-Lipschitz embedding of 
a regular ultrametric space Y into X' with dim Y > s (by Theorem II. 7p . 
Composing this embeddings and applying the resulting map to measures, 
we get a co-Lipschitz embedding Wp(Y) M- Wp{X). This implies that 

CTit^WpiX) > ciiti^ WpiY) = dimY > s 

and since s < dimX is arbitrary, we finally get crit^ ;;^(X) > dimX. 

The two corollaries then follow directly. Assume X is a compact 
space; we just saw that crit^ Wp{X) > dimX, and we proved in |Klol2] 
that crit^^(X) < M-dimX. If dimX = M-dimX = d, then we 
get crit^;^(X) = d. If dimX > M-dimX', we get crit^^(X) > 
crit^ Wp'{X'), and there cannot be any bi-Lipschitz (or even co-Lipschitz) 
map from the first Wasserstein space to the second one. 



5.3. Concluding example. — Last, we prove Proposition IL6I The 
example is constructed to have infinite Minkowski dimension (the number 
of branches above height e in its SRT grows very fast when e — )■ 0) but 
small "complexity" (its SRT has countably many ends). 

Let T be the SRT such that the root has two children: a leaf wi (thus 
h{wi) = 0) and V2, which has two children: a leaf W2 and ^3, which has 
two children and so on; and such that h{o) = 1 and h{vn) = (l + lnra)"^. 
The ultrametric space X defined by T is a sequence {wi, W2, ■ ■ ■} together 
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with an accumulation point Wao, and d{wi,Wj) = (1 + lnz)~^ when 1 < 
i < j < oo. 

For any p > 1 and any ^iV (z ^{X), formula ([T]) shows that 

n>l ^ ^ 

Fix any e > 0, and let (6„) be the sequence of sum 1 such that 6„ = 
(^j^-(i+e) fQj~ some C and all n. The map from [0, 1]^ that sends (m„) to 
the measure // such that /i({w;„}) = W7,n6„ is therefore co-Lipschitz from 
BC((c,))to(^(X),Wp with 

On = 

In particular, one can restrict this map to a co-Lipschitz map with do- 
main BC((n~*-^^^'^'*) whose critical parameter is l/2£:. It follows that 
crit^ Wp{X) > ^, and this holds for all e > 0. 

In conclusion, X is a countable ultrametric space whose Wasserstein 
space has infinite power-exponential critical parameter, as claimed. 

It seems plausible that (may be at least for ultrametric spaces) the 
critical parameter of Wp{X) is bounded below by the (lower or upper?) 
Minkowski dimension of X. The strongest conjecture would be that 
crit^ ^(X) = M-dimX for all compact metric space X. However we do 
not know how to transform an ultrametric space into one where explicit 
computation are possible without loosing too much of its Minkowski di- 
mension, and general ultrametric spaces seem difficult to handle in wide 
generality. Moreover, we do not know whether there is an Ultrametric 
Skeleton Theorem with respect to Minkowski dimension (see Question 
1.11 in |MN13bj . 
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